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The  variation  of  temperature  or  of  pollutant  concentrations 
over  a  geographic  area  are  adequately  represented  by  random  fields 
Given  a  real-valued  random  field  {Z(x),x  s  ]R  }  a  basic  problem 
is  to  interpolate  Z  over  an  area  A  from  measurements  taken 
at  n  stations  x,  ,x-,,...,x  ,  when  the  distribution  of  Z  is 

only  partially  specified.  This  is  the  motivation  of  the  present 
paper. 

It  is  shown  that  if  the  joint  distributions  are  Gaussian  the 
best  linear  unbiased  predictor  is  (among  other  properties)  admis¬ 
sible  when  used  to  predict  Z  at  a  single  point,  but  inadmissible 
in  general  when  used  to  predict  the  values  of  the  field  at  sev¬ 
eral  points.  A  Stein-like  predictor  is  produced  which  is  uniforml 
better  than  the  B.L.U.E.  in  the  latter  case.  A  nonlinear  predicto 
based  on  relaxing  the  unbiasedness  condition  on  the  B.L.U.E.,  is 
also  proposed  and  sho^^m  to  be  in  some  cases  preferable. 


Key  words;  Best  linear  unbiased  predictor,  multivariate  normal 
distribution,  admissibility,  James-Stein  estimation. 
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1.  Introduction  and  summary: 

The  questions  considered  in  this  paper  arise  naturally 

when  trying  to  interpolate  a  realization  of  a  random  field 

2  ■. 

{Z(x),x  =  IR  }  from  observations  Z (x, ) , Z (x-) , . . . , Z (x  )  made 

Li,  n 

at  n  points,  for  example  interpolation  of  air  pollutant 
concentration  field  from  fixed  monitoring  stations.  The 
first  step  is  to  study  how  to  interpolate  at  a  single  point 
X  ;  and  then,  to  study  how  to  interpolate  simultaneously  at 
N  points  forming  a  fine  grid,  say,  over  an  area  of  interest. 

However  in  the  present  paper  we  study  the  abstract  pre¬ 
diction  problem,  independently  of  the  geometry  of  the  under¬ 
lying  set  of  the  random  field.  So  consider  n+1  real  valued 
random  variables  ^i'^2'***^n  ^  with  a  joint  distribu¬ 

tion  whose  covariance  matrix  is  specified  and  whose  mean 
vector  is  the  product  of  a  specified  matrix  by  an  unspecified 
p-dimensional  vector  parameter  a. 

First  of  all,  in  Section  2,  we  are  interested  in  the 
properties  of  the  best  linear  unbiased  predictor  Z  of  Z 
based  on  the  observed  values  of  Z^ ,  Z2 ,  . . . ,  Z^^  ,  when  the 

/N  2 

risk  function  is  R(a,Z)  =  E(Z-Z)  .  We  show  that  if  the 
joint  distribution  of  , Z2 , . . . ,  and  Z  is  normal,  then 
Z  is  extended  Bayes,  minimax,  admissible  and  uniformly  mini¬ 


mum  variance  unbiased. 

Then  in  Section  3  we  consider  the  problem  of  predicting 
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simultaneously  N  random  variables  '^n+2  '**''^n+N 

(instead  of  a  single  Z)  on  the  basis  of  the  observed  values 


of  Z,  ,Z-  .  Under  the  risk  function 

1  ^  n 


1*1 


we  show  that  the  pointwise  use  of  the  best  linear  unbiased 
predictor  of  Section  2  for  each  2^  ,  i  =  n+l,...,n+N  is 
not  an  admissible  procedure  in  general  under  normality  assump¬ 
tion.  This  is  done  by  constructing  a  Stein-like  predictor 

^^n+1  '^n+2  '•••'^n+N^  ^^n+1  '^n+2  "••'^n+N^*  ^ 

necessary  and  sufficient  condition  for  the  Stein-like  pre¬ 
dictor  to  be  uniformly  better  than  the  simultaneous  point- 
wise  best  linear  unbiased  predictor  (2^+1  '^n+2  '^n+N^ 
given.  It  is  derived  as  a  consequence  of  the  following  result 
(proved  in  Section  3) :  If  X  is  a  p-dimensional  normal  random 
vector  with  distribution  N(9,Ip)  and  C  is  a  fixed  N  x  p 
matrix  then  Bn  >  0  such  that  V9 


(1)  Ejlcx/i — 2— -  C0||^  <  Eilcx  -  ce|]^ 

if  and  only  if 

(2)  max  eigenvalue  of  CC'  <  j  trCC 

This  theorem  extends  the  basic  result  of  James  and 
Stein  (1961)  which  says  tha*  in  the  case  ^  N.S.C. 

for  (1)  to  hold  is  p  >  3. 


-3- 


Finally,  in  Section  4,  going  back  to  the  problem  of 
predicting  a  single  Z  ,  we  present  a  biased  non-linear 
predictor  Z*  obtained  as  a  modification  of  Z  when  the 
unbiasedness  condition  is  relaxed.  We  compute  the  risk  func¬ 
tion  of  Z*  and  conclude  that  Z*  should  be  preferred  to 
Z  in  some  cases  when  the  unspecified  vector  parameter  a 
can  be  bounded. 


f 


2,  The  best  linear  unbiased  predictor  of  a  single  random 
variable  and  its  properties  in  the  normal  case; 

Preliminaries ; 

Let  ^  denote  the  column  vector  formed  by  the  observed 

values  (Z^  ,Z2  ,...,2^^).  Then  the  assumptions  described  in 

1 

Section  1  can  be  written  as  follows; 


E  Z  =  F  '  a  ,  Cov  Z  =  K 

2 

E  Z  =  f ' a  ,  var  Z  =  a 
Cov(Z,Z)  =  k 

F  (pxn  matrix) ,  f  (pxl) ,  k  (nxn) ,  k  (nxl)  , 

2 

and  a  (scalar)  are  specified  quantities, 
a  (pxl  vector)  is  an  unspecified  vector-parameter 

The  best  linear  unbiased  predictor 

Z  a  +  X,  Z,  +  X-Z-,  +  ...  +  X  Z^  (that  we  write  in  short 
o  XI  ^  X  n  n 

Z  =  X^  +  X'^)  of  Z  is  obtained  by  solving  the  following 
system 

^  2 

!E(Z-Z)'‘  minimum  2.1.  a 

E  Z  =  f 'a  for  all  a  2.1.b 


The  minimization,  in  2.1. a  ,  a  priori  depends  on  a  ,  but 
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condition  2.1.b.  implies  =  0  ,  FX  ®  f  ,  and  E(Z-Z)^ 
independent  of  a.  The  solution,  which  is  classical,  is 

X  *  K“^k  -  K"^F' (FK“^F’)”^FK“^k  +  k"^F ' (FK"^F ' ) ”^f  2.2 

and  the  mean  square  error  of  the  best  Z  is 
E(Z-Z)^  =  Var(Z-Z)  = 

-  k'K”^k  +  (FK"^k  -  f ) ' (FK"^F ' ) (FK"^k  -  f)  2.3 

For  subsequent  use  of  these  unwieldy  formulas  we  introduce 
the  following  notation:  G  =  FK~^F'  (we  assume  it  is  non 
singular)  ,  and  <5  =  FK”^k  -  f . 

A. 

Z  can  be  given  a  simple  interpretation  (Goldberger,  1962) 
since  Z  =  F'a  +  e  where  Ee  =  0  and  Cov  e  =  K.  The  gen¬ 
eralized  least  square  estimator  of  a  based  on  £  is 
a  *  (FK  ^F ' )  ^FK  =  G  ^FK  and  the  best  linear  unbiased 
predictor  of  Z  can  be  rewritten 

Z  =  f'a  +  k'K"^[Z-F'a]  2.4 

Formula  2.4,  in  view  of  the  best  linear  predictor 

Z  =  f'a  +  k'K"^[Z-F'a]  2.5 

of  Z  when  a  is  specified,  shows  the  natural  origin  of  Z. 
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Finally  let  us  observe  that  ECZ-Z)"^  -  a  -k'K  k  +  0'G  0 

and  E(Z-Z)^  »  -  k'K“^k.  Therefore  0'g”^0  represents 

the  price  we  pay  for  not  knowing  a  ,  while  requiring  unbias- 

t  edness  of  Z.  In  Section  3  and  Section  4  we  show  how  this 

li 

added  term  in  the  mean  square  error  may  be  reduced. 

'  Properties  of  Z  in  the  normal  case; 

In  addition  to  the  assumptions  laid  out  at  the  beginning 
of  this  Section,  we  now  assume  that  Z  and  ^  are  jointly 
multivariate  normal. 

First  of  all  suppose  that  the  vector  parameter  a  has 
i  the  multivariate  normal  prior  distribution  N(a,r) 

Lemma  2.1:  Let  Z..  „  be  the  Bayes  rule  using  mean  squared 
ji  error  to  predict  Z  ,  when  we  observe  £  ,  and  when  a  -  N(a,r)  . 

It  has  the  form 


This  establishes  formula  2.6.  The  proof  of  formula  2.7  is 
omitted  because  the  calculations  are  straightforward  and 
tedious,  using  the  fact  that  ECajZ)  =  (G+r”^)  +  (G+r“^)”^FK 

Theorem  2.1;  The  best  linear  unbiased  predictor  Z  of 
Z  is  extended  Bayes  and  minimax. 

Proof:  The  risk  of  Z  as  a  function  of  a  is  R(a,Z)  = 

2  -1  -1 

a  -k'K  k+0'G  0.  This  expression  does  not  depend  on  a  , 

therefore  Z  is  an  equalizer.  Secondly,  when  all  the  eigen 
values  of  F  tend  to  +  “  ,  the  risk  of  Z  ^  ,  given  by 
2.7,  tends  to  R(a,Z).  Therefore  Z  is  extended  Bayes,  and 
since  it  is  an  equalizer  it  must  be  minimax.  ■ 

A. 

The  next  two  problems  are  ro  show  that  Z  is  admissible 
and  UMVUE.  So  now  we  abandon  the  assumption  that  a  has 
a  prior  distribution  and  return  to  the  case  when  it  is  a  free 
unspecified  vector-parameter. 

Lemma  2.2:  For  any  predictor  Z*  of  Z  based  on  ^  we  have 

E(Z*-Z)^  =  -  k'K"^k  +  E{Z*-E(ZlZ) 2.8 

Proof :  Z  -  Z*  can  be  written  as  the  sum  of  Z  -  E(Zl^)  and 
E(ZlZ)  -  Z* .  Next,  Z  -  E(Z|£)  is  uncorrelated  with  £  ; 
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hence,  by  normality  it  is  independent  of  £  ,  and  independent 
also  of  E(Z|£)  -  Z*  because  this  is  a  function  of 
Formula  2.8  follows.  ■ 

^  -1  -1 

Lemma  2.3;  a  »  G  FK  ^  is  a  sufficie^^t  estimator  for  a  , 
among  the  estimators  based  on 

This  is  a  standard  result  in  generalized  regression. 

We  may  just  mention  for  completeness  that  E(Z_la)  =  F'a  and 
Cov(^|a)  =  K  -  F'G  ^F  .  Hence  £  given  a  ,  which  is  normally 
distributed,  has  a  distribution  which  does  not  depend  on  a  . 

Theorem  2.2;  Z  is  an  admissible  predictor  of  Z. 

Proof ;  Suppose  that  Z  is  not  admissible  and  that  Z*  =  g(^) 
is  such  that 

E(Z*-Z)  ^  <  E(Z-Z)  ^  2.9 

for  all  a  with  strict  inequality  for  at  least  one  a  . 

Then  from  lemma  2 . 2  we  deduce  that 

E{Z*-E(Z| Z) }^  <  E{Z-E(Z|Z) 2.10 

for  all  a  ,  with  strict  inequality  for  at  least  one  a. 

E(Z|Z)  =  -  0'a  ,  and 

Z  =  k'K"^Z  -  0'a 
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Therefore,  pursuing  our  argToment,  we  have. 

E{k'K"^Z-g(Z) -0'a}^  <  E{0'a-0'a}^  2.11 

for  ail  a  ,  with  strict  inequality  for  at  least  one  a  . 

Then  by  lenuna  2.3  and  the  Rao-31ackwell  theorem,  the 
quantity  E{k'K  ^£-g(Z)|a}  which  is  a  function  of  a  and 
does  not  depend  on  a  ,  is  a  better  estimator  of  0'a  than 
0'a.  Or,  in  other  words,  0'a  is  not  an  admissible  esti¬ 
mator  of  0'a. 

But  this  is  a  contradiction  because  a''N(a,G'"^)  and 
therefore  it  is  elementary  to  see  that  0'a  is  admissible  for 
0'a  among  the  class  of  estimators  function  of  a  .  (See  also 
Cohen  (1965).) 

Theorem  2.3;  Z  is  the  uniformly  minimum  variance  un¬ 
biased  predictor  of  Z. 

i 

Proof ;  From  lemma  2.2  we  see  that  it  is  sufficient  t^^  show  | 

that  the  term  E{E(Z1^)-Z}^  is  uniformly  minimum  in  a  ,  i 

among  unbiased  predictors  of  Z.  | 
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unbiased  estimator  of  0'a  has  its  variance  bounded  from 
below  by  0'I(a)  ^0  where  1(a)  is  the  Fisher  information 
matrix  of  ^  on  a  .  It  turns  out  that  1(a)  =  G  .  Hence 
the  Cramer-Rao  lower  bound  is  0'g“^0  .  This  is  attained  by 
0’a  ,  because  E  0'a  =  0'a  ,  and  E{0'a  -0'a}^  =  0'Cov(a)0  =  0'G"^0 
In  conclusion  for  any  unbiased  predictor  g(Z_)  of  Z  we 
have  E{E(Z|£)  -  g(£)}^  =  E{k'K“^Z_-g  (^)  -0'a}^  ^  0'G“^0  = 
E(0'a-0'a)^  =  E{E(Z|Z)  -Z}"  for  all  a  . 
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3.  A  Stein  like  predictor  for  predicting  simulcaneously 
several  random  variables: 

Suppose  now  that  we  want  to  predict  N  random  variables 

^n+1  '^n+2  '’**'^n+N  observations  of  ,Z2  , . . .Z^. 

The  assumptions  that  we  make  here  are 


Z  -  N(F'a,K) 

Vi  -  N(f.’a,0.2)  i  =  1,2 . N 

Cov(^,Z^^^)  =  i  =  1,2,..,,N 


and  all  the  Z^'s  (the  observations  and  the  values  to  be 
predicted)  are  jointly  normal.  As  before  all  the  parameters 
appearing  in  the  distributions  are  specified,  except  for  the 
free  vector-parameter  a  . 

The  object  of  this  section  is  to  exhibit  a  predictor 
Z  of  the  type  introduced  by  Jaj  and  Stein  (1961)  to  esti¬ 
mate  the  mean  of  a  multivariate  normal  random  vector,  such 
that 


N 

E 

i=l 


E (Z  . -Z  .  ) 
^  n+i  n+i^ 


N 

E 

i=l 


E(z  _^.-z 

n+1  n+i 


for  all  values  of  a  . 

In  Section  2  we  saw  that  the  admissibility  of  Z  ultimately 

reduced  to  the  admissibility  of  0'a  to  estimate  0'a  when 
-1 

a  -  N(a,G  )  .  So  it  is  natural  that  in  the  present  section 
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the  key  result  will  be  the  inadmissibility  of  $a  to 
estimate  <5a  where  $  is  a  Nxp  matrix  satisfying  certain 
conditions.  This  is  stated  in  a  general  form  in  the  following 
theorem. 

Theorem  3.1;  Consider  X  ^  N(9,Ip)  ,  a  p-dimensional 
normal  random  vector;  let  C  be  an  Nxp  matrix  then  3n>0 
such  that  for  all  9  = 

Ell  CX^l-j^-^^  -C9||2  <EliCX-C9!!^  3.1 

if  and  only  if 

max  eigenvalue  of  CC  <  j  tr  CC  3.2 

Remark ;  Condition  (2)  implies  rank  C  ^  2.  In  particular 
if  C  is  the  identity,  condition  (2)  is  satisfied  if  and 
only  if  p  ^  3,  which  is  a  classical  fact  in  Stein-estimation 
(see  James  and  Stein  (1961)). 

Theorem  3.1  will  be  proved  after  the  following  two 
lemmas . 

Lemma  3.1;  If  X  ~  N(9,I  ),  for  any  row  vector  C  and  any 

p 

function  g  such  that  the  following  expectations  exist,  we 
have 


E(cXg(ll  X|l^  ))  = 


c9Eg(X 


2 

P+2K+2 


) 


3.3 
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and  also 

E{xx'g( )  >=  * 

80'E9(X^+2K+4> 

where  K  ~  Poisson 

For  a  proof  the  reader  is  referred  to  Stein  (1966)  where 
the  essential  technique  is  shown  in  order  to  prove  formula  3.3; 
the  same  technique  can  be  used  to  prove  formula  3.4. 

Lemma  3.2;  Under  the  condition  of  theorem  3.1,  we  have 


-2J)  -  csf  -  E||c«-e)  -  n  -SS-||^ 


-  E«-e)'  c'c(x-e)  -  2  E(x-e)'  c'c  -SL  +  e  .  3.5 

i*u  m 
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Since  for  two  vectors  in  "Sp  x'y  =  tr  yx',  expression 
(3.6)  can  be  rewritten 


tr  C  E(X-0)(X-8)’  C  -  2n  cr  C  E  C  +  tr  C  E 


XX' 


llxll 


2  -r  i\  ut  V.  E,  - C' 


3.7 


m\ 


By  Lemma  3.1  and  using  E  i  ^  and  E  — 

n-2  v2 


n 


n 


(n-2) (n-4) 


expression  3.7  is  equal  to 


tr  CC  -  2n  tr  C{l  E 


P  2 

V»-2K+2 


+  96'  E -j-i - 96'  E -2“^^= -  C  + 

, 


^+2K+4  ^+2K+2 


tr  C 


Ivfe; 


+  09’  eF-— - 1  |c' 


V»-2K+2  ^^2K+A 


tr  CC'  -  2n  tr  CC 


’  ^  P+2K  ■  IIC9|J^  ^{p+2K+2  “ 


n  tr  CC  E 


^  2  ti  ii2  1 

(P+2K)  (P+2K-2)  ^  ®  ■(P+2K+2)  (p+2K) 


■  tr  CC 


+  E  — — 
P+2K  ^  *=■  (p+2K) 


j, _ ; 

(p+2K-2)_ 


ljc0lr  (An  +  n^)  E 


(p+2K+2)(p+2K)  • 


Finally,  to  get  expression  3.5  given  in  the  statement  of 
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Lenntia  3.2  we  have  to  show 


2K 


(P+2K+2)  (p+2K)  (p+2K)(p+2K-2) 


This  is  done  as  follows; 


_ I - -  y 

(P+2K+2)  (P+2K) 


11611^  Jo 


-L.  I 

\\ef  k-i 


k 

1 

k! 

(p+2k+2)(p+2k) 

-l|9f/2 

k+1 

^  k+1 

(k+1) ! 

!  (p+2(k+i))(p+: 

r 

1  k 

G 

k! 

(p+2k)(p+2k-2) 

2K 

11011 


2  (p+2K) (P+2K-2)  ’ 


which  completes  the  proof  of  Lemma  3.2.  Now  we  can  turn  to 
the  proof  of  Theorem  3.1. 

Proof  of  Theorem  3.1;  The  key  to  the  proof  is  to  observe  that 

.2 


llceii; 

0  ll0ir 


oax 


max  eigenvalue  of  CC  . 
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First  if 

max  algenvalue  of  CC  £  cr  CC’  ,  3.8 

then  Ejjcxfl  -  — -  C9j|^  is  bounded  by 

tr  CC’  |l  -  2n  E  E  (p+2K)  (\>-2+2K) 

2K  7  7  1 

2(l-eyn  E  (p+2kUp-2+2K)  ^  (P+2K)  (p-2+2K)  )  * 

In  turn  to  bound  expression  3.9  we  need  p  ^  3  ;  but,  as 
observed  in  the  remark  on  page  12,  this  is  implied  by  3.8. 
Then  expression  3.9  is  bounded  by 

tr  CC  |l  -  2£T1  E  +  n  E  ■^+2k|  * 

Therefore  if  we  pick  n=£  ,  we  get 
=11“  ^  1  «  E  <  tr  CC'  -  Eljcx-cell=  . 

Conversely  if  max  eigenvalue  of  CC  _>  j  trCC  WS. 
can  choose  the  direction  of  9  such  that 

Jlcejii. 

I|8f 


max  eigenvalue  of  CC* 


based  on  Z 


are 
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n+i 


ki'K"^[Z-F'a] 


ki'K"^Z 


-  0^'a 


i=l,2, . . . ,N 


And  we  know  that  they  are  individually  admissible, 
that  it  is  not  the  case,  in  general,  when  they  are 
taneously,  is  the  object  of  the  following  theorem; 

Theorem  3.2;  For  i=l,2,...,N  ,  define 


a'G  a 


then  3n  >  0  such  that,  for  all  a 


N 


N 


2  "  lE(Z_,  -Z„,) 


i*l 


i=l 


n+i  n+i 


if  and  only  if 


max  eigenvalue  of  [0^'G”^0j]  < 


^  -1 
2.  0i'G  ^0. 

i=l  ^  ^ 


where  [0^'G  ^0^]  is  the  N^N  matrix  whose  (i,j) 
is  0^’G"^0j  . 

_i.  _  i 

Proof;  Since  Z  -  N(F'a,K)  we  have  K  ^Z  -  N(K  ^ 


Consider  an  orthogonal  matrix  H=(  ^1 

VH., 


The  fact 
used  simul- 

3.10 

3.11 

3.12 

element 

F’a,Ip) 


such  that 


19- 


=  u(K  ^F')  where  stands  for  the  vector 

space  generated  by  the  columns  of  Then 


i  /  H,K  ^  F'a 

HK  Z  -  N  I  ,  I 


let  M  =  ^  F'  ;  it  is  of  size  pxp  and  non  singular  since 

we  assumed  that  FK^^f'  is  of  full  rank.  Let  a  =  Ma  and 
a  =  Ma  ,  then  a  -  N (Ma,MG”^M' ) .  Let  us  show  that  MG”^m'  is 
actually  1^  ;  we  start  from 


FK  ^ 

F  ' 

1 

1 

-T  , 

.  ”  2 

FK 

)  K  F' 

1  1 

"T 

i- 

FK  F  '  -  FK 

■  H2'H2K 

therefore  M  M  =  G 

i. 

2 


But  H2K  ‘  F '  =  0  ; 

This  means  that  M  G 
In  conclusion,  we  have  ct  ^  N(a,I 


Then 


M '  M  G 


=  I 


is  orthogonal,  hence  MG~^m'  =  1^. 


From  lemma  2.2  we  can  write 


N 

I  E(Z 
1=1 


n+i 


2 


N 

lE(Z 

i=l 


n+i 


E(Z 


n+i 


+ 


N 

IE(Z 

i=l 


n+i 


E(Vi 


3.13 


the  first  term  of  the  right  hand  side  of  3.13  is 
2-  2  '  -1 

2.  {o .  -  k.  K  k.).  On  the  other  hand,  since 

i=l  1  ^  1 

*  k/  -  0^'  a  the  second  term  of  the  right  hand 

side  of  3.13  can  be  rewritten 


ZE(0.'ai-_  -  0.'  a) 
i»l  ^  \  aGa/  ^ 


•  2 


3.14 


Note  that  lla^H  =  a 'a  =  g'n'mc  =  a'ca  ;  and  let  C  be  the 

Nxp  matrix  whose  i^^  row  is  0/  M  .  Then  expression  3.14  becomes 


-  0/Ma)2  »  EllCa(l-^j  -  CaH^ 


Since  a  -  N(a,Ip)  ,  by  Theorem  3.1  we  know  that  3n  >  0  such 
that  for  all  a 


E  C  a/1  - 


-  Call^  <  EllCa-  CaH^ 


if  and  only  if  maxeigenvalue  of  cc'  <  1  tr  CC '  . 

To  finish  the  proof  let  us  observe  on  the  one  hand  that 
cc'  =  [0/  G  on  the  other  hand  (by  going  backward 

in  the  computation)  that 


El  C  a  -  C  a 


=  ,Z. 


this,  after  another  application  of  lemma  2.2,  completes  the 
proof  of  Theorem  3.2. 


based  on  Z,,Z-,...Z  . 

1  ^  n 


Coming  back  to  the  problem  of  predicting  a  single  Z  from 
the  observations  we  show  in  this  section  another  way  to 
reduce  the  extra-term  0'g”^0,  in  the  mean  squared  error  of 

A 

the  estimator  Z,  that  we  have  already  discussed  in  Section  2. 

The  idea  is  to  try  to  dispense  with  unbiasedness.  This 
leads  to  a  predictor  Z  ,  which  depends  on  the  unspecified 

a 

vector  parameter  a  .  It  is  therefore  of  no  use  itself  but 
the  predictor  denoted  symbolically  2^  ,  obtained  by  substitu- 

ci 

ting  a  for  a  in  ,  turns  out  to  be  interesting;  we  will 

denote  it  Z*. 

Preliminary  calculations  show  that  the  predictor 

/N  ''2 

Z-  =  +  ...  +  X  Z  which  minimizes  E{Z  -2)  ,  with  no 

a  X  X  Ii  II  3, 

unbiasedness  restriction  with  respect  to  a  ,  is 


i 
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the  risk  of  2. 

So  the  predictor  we  want  to  study  in  this  section  is 


Z*  =  k'K 


-1 


Z  - 


I 

0  a 


1+a  G  a 


-1 

.  a'FK  -^Z 


4.3 


as  derived  from  formula  4.1  by  substitution  of  a  for  a  . 

In  the  sequel  we  assume  that  Z  and  £  are  jointly  norm¬ 
ally  distributed. 

Theorem  4.1;  The  predictor  Z  can  be  reexpressed  as 


★ 

Z 


z  + 


0  a 

l+a'c  a 


4.4 


and  its  risk  is 


*  2  2 
E  (Z  -Z)  = 


k'K"^k  +  Var 


I  ^  '  /V  \ 

0  ^  3. 

A  ,  A 

1+a  G  a  i 


+  E 


0  a 


vl+a  G  3/ 


To  prove  Theorem  4.1  we  need  two  lemmas . 


Lemma  4.1;  Consider  X  ~  N(ji/2)  of  size  nxl;  suppose  1  is 
positive  definite.  Let  M  be  a  pxn  matrix  (p£n)  of  full 
rank,  and  v  a  pxl  vector.  Then 


E (X| MX  =  V)  =  y  -  2m'  (M2m' ) "^My  +  2m'  (MIm' ) 
Cov(X|MX  =  v)  =  2(1  -  m' (M2m')"^M2) 
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and  from  Lemma  4.2  we  have 


E(Z-Z|a)  =  X'E{Z|a)  -  E(Zla) 

=  (k’K"^-0'G"^FK"^)F'a  -  (k' k“^F ' a  -  0 ' a) 


=  0' (a-a) 


Therefore 


(Z-Z) 


01  ^ 
a 


vN  I  ^ 

1+a  G  a 


0  a 


(0'a)^  ' 

/N  f  /s 

l+a  G  a  J 


-  Covf0 ' > 

\ 


0'a  ^ 

1+a'G  a  / 


Then 

E(Z*-Z)^  =  -  k'K"^k  +  0'g"^0  +  E 


0  a  \ 

^  yv  I 

l+a  G  a  / 

Formula  4.5  follows  after  some  more  manipulation,  noting  that 
0'g”^0  =  Var  (0'a) .  ■ 

<ff  ★  2 

The  next  problem  is  to  compare  the  risks  R{a,Z  )  =  E(Z  -Z) 
and  R(a,Z)  =  E{Z-Z)^.  Let  R(Z)  =  R(a,Z)  because  it  does  not 
depend  on  a  .  Since  we  showed  in  Section  2  the  admissibility 

^  if  ^ 

of  Z  ,  Z  cannot  be  uniformly  better  than  Z  when  a  varies. 


2  Cov  ( 0  a 


0'a 


L \ 1+3  G  3 


However,  we  have  the  following  result 


Theorem  4.2;  R(0,Z*)  <  R(Z)  and  lim  R(a,Z*)  =  R(Z) 

llall^  +  » 

(I  V 

— ^  )  =  0  because  the 
1+a  G  a  / 

I  ^ 

distribution  of  a  then  is  N(0,g”^)  and  — ^  has  a  dis- 

1+a  G  a 

tribution  symmetric  about  zero.  Therefore  R(0,Z*)  = 

0  a  — ^  and  again  since  a  -  N(0,G  ) 

1+a'G  a  / 

we  have  Varf0'a  .  — ^  \  <  var(0'a)  =  0'g”^0  hence 

\  1+a'G a  / 

R(0,Z*)  <  R(Z) . 

Next  note  that  when  ||a||  -»- +  ®  the  term  — ^  tends 

1+a'G  a 

to  zero  in  probability,  and  therefore  also  in  distribution. 

And  since  there  exists  a  constant  c  such  that  Vbe]R^  , 

I 

I  0  b  I 

I  -  I  <  c  we  deduce  by  a  classical  convergence  theorem 

1+b  G  b 

that  all  the  moments  of  — ^ —  tend  to  zero.  Then  because 

1+a'G  a 

Var0'a  =  0'g  ^0  we  have 


0^  -  k'K"^k  +  Var 


lim  Var[0'a 
Hall  -*•  +  « 


0'a  \ 

1+a'G  a  / 


+  E 


0'a 


I 

1+a  G  a 


This  completes  the  proof  of  Theorem  4.2.  ■ 

We  state  briefly  some  final  results  without  proving  them, 
referring  the  interested  reader  to  the  report  of  the  author  (1979) 
The  risk  of  Z*  can  be  reexpressed  as  follows 


then  if  we  define  CqCih)  by 


Cq  (m) 


2  cov^X, 


—  1 

1+X2  / 


where  X  •'  N(rti,l),  it  is  easy  to  see  that 


R(m,m*)  =  R(m,m)  + 

n  'JV  a  / 

In  the  case  n  =  3  the  figure  below  representing  R(m,m*) 
and  R(m,m)  shows  that  if  we  know  a  priori  that 

m  6  [iAq  -  .8a,mQ  +  .8a]  for  some  known  constant  mQ  we  should 
subtract  the  constant  from  all  our  observations  and  use 

m*  instead  of  m. 


j 


rr.z.  X 


ure t  Comparison  of  the  risk  functions  of  m  and  m* . 
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UNCLASSIFIED 


The  variation  of  temperature  or  of  pollutant  concentrations 
over  a  geographic  area  are  adequately  represented  by  random 
fields.  Given  a  real-valued  random  field  (Z(x),x  a  basic 

problem  is  to  interpolate  Z  over  an  area  A  from  measurements 
taken  at  n  stations  ,X2  f . .  •  / ,  when  the  distribution  of  Z 
is  only  partially  specified.  This  is  the  motivation  of  the  present 
paper. 


It  is  shown  that  if  the  joint  distributions  are  Gaussian 
the  best  linear  unbiased  predictor^ is  (among  other  properties) 
admissible  when  used  to  predict  Z  at  a  single  point,  but  in¬ 
admissible  in  general  when  used  to  predict  the  values  of  the 
field  at  several  points.  A  Stein-like  predictor  is  produced  which 
is  uniformly  better  than  the  B.L.U.E.  in  the  latter  case.  A  non¬ 
linear  predictor,  based  on  relaxing  the  unbiasedness  condition 
on  the  B.L.U.E.,  is  also  proposed  and  shown  to  be  in 
preferable.^ 
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